New Methods of OAE signal analysis


A Minimum Variance Spectral Estimation-Based Time-Frequency Analysis for Click-Evoked Otoacoustic Emissions

 

Z. G. Zhang 1,3 , V. W. Zhang 2, S. C. Chan 3, B. McPherson2, Y. Hu 1


1 Department of Orthopaedics and Traumatology The University of Hong Kong, Pokfulam Road, Hong Kong

2 Division of Speech and Hearing Sciences The University of Hong Kong, Pokfulam Road, Hong Kong

3 Department of Electrical & Electronic Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong



 

Dr Z. G. Zhang received his B.Sc degree from Tianjin University, China, in 2000, and graduated with an M.Phil degree in electronic engineering from the University of Science and Technology of China in 2003. He obtained his Ph.D. degree in the Department of Electrical and Electronic Engineering at the University of Hong Kong in 2008. He was a research assistant in Department of Orthopaedics and Traumatology, the University of Hong Kong, during 2008. He is currently a postdoctoral fellow in Department of Electrical and Electronic Engineering at the University of Hong Kong. His research interests are in general area of digital and biomedical signal processing.


1. Introduction

          

Traditionally, CEOAE responses are analyzed in the frequency domain via the Fourier transform. This is very useful for a primary description of CEOAE responses and is a method which aligns with clinical needs. However, CEOAEs are nonstationary signals with frequency dispersion along the time axis. Therefore, traditional spectral analysis using Fourier methods may miss some important temporal information. Recently, time-frequency analysis (TFA) of OAE has attracted increasing interest. Various TFA techniques, such as the Wigner-Ville distribution (WVD) [1], the STFT [2], the wavelet transform [3], and the matching pursuit [4], have been applied systematically to study the characteristics of different types of OAE signals. The WVD has high time-frequency resolution but it often results in severe cross-terms. The STFT originates from the Fourier transform and it faces a basic time-frequency resolution tradeoff problem, as it cannot achieve good time resolution and frequency resolution simultaneously. Wavelet transform has been proved to an effective technique for CEOAE analysis, but it has a degraded frequency resolution for high-frequency components and a degraded time resolution for low-frequency components. Matching pursuit can overcome limitations in wavelet transform by using a greedy search to expand the signal into a set of basic atoms, but its computational complexity is too heavy. In this study, a new nonparametric TFA method based on minimum variance spectral estimation (MVSE) is introduced. The MVSE is a spectral estimation algorithm which can provide higher resolution than other nonparametric spectral estimates such as the periodogram [5]. Using the same sliding window operation as in STFT, the conventional MVSE in the frequency domain can be extended to a windowed MVSE (WMVSE). Inspired by the window selection criterion of wavelet transform, we address the window selection problem of WMVSE by using a long window at low frequencies and using a short window for high frequencies. The frequency-dependent window size produces the frequency-dependent WMVSE (FDWMVSE), which can be used to analyze the time-frequency characteristics of CEOAEs.

2. Materials and Mthods

2.1. Experimental protocols

        A total of 100 neonatal CEOAE responses used in the experiments were recorded from well-baby nurseries at the Hong Kong Adventist Hospital. The average ambient room noise level with OAE equip ment in operation was less than 50 dBA. All the data fulfilled the following CEOAE criteria: stimulus stability; 75%; whole wave reproducibility ³ 70%; overall CEOAE response ³ 5 dB SPL; at least three of five test frequency bands centered at 1, 1.4, 2, 2 .8 and 4 kHz with SNR ³ 3 dB. OAE data were collected using ILO USB equipment plus V6 OAE clinical software (Otodynamics Ltd., UK). Nonlinear “ QuickScreen ” mode was used for data collection ( stimulus level of 75-80 dB peSPL ; analysis window of 12. 8ms ). Res ponse stopping criteria for CEOAE measurements required at least 70 OAE stimuli presentations, or, if no clear CEOAE response at 70 presentations, up to 260 responses were obtained.

2.2. Frequency-dependent windowed m inimum variance spectrum estimation (FDWMVSE)

          In MVSE, an adaptive filter is used to minimize the energy of other frequency components except for the desired frequency component, hence the name minimum variance. By recording the signal energy of the adaptive filter designed for different freq uencies, an energy spectrum of the input signal is obtained. Suppose the OAE signal is and the filter order is , the MVSE at a frequency of interest is calculated as

 

,

(1)

where is the by autocorrelation matrix of and .

To track the time-varying frequency contents of nonstationary CEOAEs , a windowed MVSE (WMVSE) is proposed and its idea is 1) to slide the center of a -size window to each time instant , 2) to window the input signal to yield -size data segments , 3) to calculate the autocorrelation matrix from , and 4) to obtain the WMVSE at time instant as

 

.

(2)

         The time-frequency resolution tradeoff proble m also exists in WMVSE, and the WMVSE with a fixed and for different frequency bands is not a good method . Motivated by the wavelet transfor m that uses adaptive and , a new frequency-dependent WMVSE (FDWMVSE) method is proposed (see reference [ 6 ] for full details) . The FDWMVSE uses frequency-dependent windows: shorter windows for short-duration hi gh-frequency CEOAE components to achieve good time resolution and large window sizes for long-duration low-frequency components to obtain high frequency resolution. More precisely , in the FDWMVSE method , the frequency-dependent window size is adjusted to be inverse proportional to frequency . The relation can be approximately expressed using a linear function as , where and are parameters fo r varying the window size function. In the study, with the help of a synthetic CEOAE model [3] , we estimated these parameters in a trial and error manner and determined the window size function . Then, the frequency-dependent order was chosen as .

 

3. Results

      One real CEOAE sample is illustrated in Fig. 1 to show the effectiveness of the proposed FDWMVSE method . For comparison , the analysis results of the wavelet transform (scalogram) method are also shown . In Fig. 1 , two components located between 3.5 kHz and 4 kHz cannot be identified in the scalogram but they are shown in the result of FDWMVSE. Based on the periodogram and their positions, these two high-frequency components should not be spurious .

MATLAB Handle Graphics

                                                                  Fig. 1 . Time-frequency representations of one real CEOAE signal using FD WMVSE and scalogram .

 

         The performance of the proposed FDWMVSE method was further assessed and compared with wavelet transform quantitatively by the parameters of peaks det ected in the time-frequency distributions. To study the characteristics of CEOAE components in different frequency bands, the whole frequency range was divided into six bands: 0 < f <= 1 kHz, 1 kHz < f <= 2 kHz, 2 kHz < f <= 3 kHz, 3 kHz < f <= 4 kHz, 4 kHz < f <= 5 kHz, and f > 5 kHz. In each frequency band, the location of its peak was detected as the time-frequency point with the maximum value of the time-frequency distribution in the f requency band . The latency and frequency of peaks can reveal meaningful information regarding CEOAE components. Further, t o compare the frequency resolution of various TFA method s , the peak width, which denotes 80% energy density of the peak power in each frequency band , was measured in both time and frequency dimensions. A narrow peak indicates a relatively high resolution, whereas a wide peak means low resolution.

          Based on the peak detection approach , the number of peaks detected in scalograms and FDWMVSE s among 100 real CEOAE signals are illustrated in Fig. 2 , where the “ shared ” peaks mean that these peaks can be detected in both scalograms and FDWMVSEs

                                                            Fig. 2 . Number of peaks detected in time- frequency representations among 100 real CEOAE signals using scalogram and FDWMVSE.

 

The peak distributions and the regression lines are illustrated in Fig. 3 . Using the line identification proposed in [ 7 ] , where the relation of peak latency τ (ms) and peak frequency f (kHz) was expressed as τ = κ f λ , we can obtain τ = 8.38 f -0.26 for scalogram and τ = 8.45 f -0.28 for FDWMVSE.

MATLAB Handle Graphics
                                         Fig. 3. Distribution of peaks detected in scalogram and FDWMVSE for 100 real CEOAE signals : the upper figure shows the peak locations and the regression results and the lower figure shows
                                                   the mean and standard deviation values of peak locations at each of five frequency bands.

 

                 Last, the peak width parameters in time and frequency dimensions using scalogram and FDWMVSE are shown and compared in Fig. 4 .



                                                                                         Fig. 4 . P eak width values in time- frequency representations of 100 real CEOAE signals using scalogram and FDWMVSE.

4. Discussion

        The performances of the FDWMVSE and scalogram were evaluated and compared using the number of p eaks detected in the time-frequency distributions among all of the 100 real CEOAE data. It can be seen from Fig. 2 that, t here were most peaks in frequency band 1-2 kHz for the two methods , and FDWMVSE reveal ed slightly more power peaks (more CEOAE components) especially in frequency band 2 - 4 kHz . The reason may be that the FDWMVSE method has narrow peak widths in the frequency domain and hence it can distinguish adjacent CEOAE components in higher frequency bands.

        The distributions of peaks detected by FDWMVSE and by scalogram were also quite similar in the time-frequency domain , as seen from Fig. 3 . There were no significant differences ( p >0.05) for peak time and peak frequency between the two TFA methods. On the other hand, except for the peak width in t he frequency dimension at the frequency band 0 – 1 kHz, all other peak width values in time and frequency dimensions show significant differences ( p < 0.05) , with the peak width values of the FDWMVSE method being obviously smaller than those of scalogram. Fi g. 4 also shows that the differences of peak width in time dimension between the two methods decrease with frequency, while the differences in peak width in the frequency dimension increase with frequency. T he current results imply that the proposed FDWMVS E method has good time-frequency resolution, and this method is expected to have potential applications in CEOAE studies.

5. Conclusions

         The proposed FDWMVSE method is a good time-frequency analysis method for CEOAE because it employs frequency-dependent win dow sizes to better match the time-frequency characteristics of CEOAEs. By quantitative analysis and comparison, we can tell that the proposed FDWMVSE method has a similar performance to the classical wavelet transform in identification of peak locations at each frequency band. In higher frequency bands, FDWMVSE was able to identify more peaks due to its high frequency resolution. The FDWMVSE method can also offer smaller and narrower peak scopes than the wavelet transform.

 Acknowledgement


      This study was partially supported by the University Grants Council, Hong Kong, GRF Grant HKU7434/04M.

6. References

[1] Cheng, J., 1995. Time-frequency analysis of transient evoked otoacoustic emissions via smoothed pseudo Wigner distribution. Scand . Audiol. 24(2) , 91-96 .

[2] Hatzopoulos , S., Cheng , J. , Grzanka , A. , Martini , A., 2000. Time-frequency analyses of TEOAE recordings from normal and SNHL patients. Audiology 39(1) , 1-12.

[3] Tognola, G., Grandori, F., Ravazzani , P., 1998. Wavelet analysis of click-evoked otoacoustic emission. I EEE Trans . Biomed . Eng . 45 (6) , 686 - 697 .

[4] Jedrzejczak, W. W. , Blinowska, K. J. , Konopka, W. , 2005. Time-frequency analysis of transiently evoked otoacoustic emissions of subjects exposed to noise. Hear . Res . 205(1-2) , 249-255 .

[5] Kay, S. M. , 1987. Modern Spectral Estimation: Theory and Application . Prentice-Hall , Englewood Cliffs, NJ.

[6] Z hang , Z. G., Zhang, V. W., Chan, S. C., McPherson, B., Hu, Y., 2008. Time- frequency analysis of click-evoked otoacoustic emissions by means of a minimum variance spectral estimation-based method . Hear . Res . 2 43 (1-2) , 18-27 .

[ 7 ] Sisto, R., Morleti, A., 2002. On the frequency dependence of the otoacoustic emission latency in hypoac oustic and normal ears . J. Acoust. Soc. Am. 111(1) , 297 - 308.